A system to create sets of semi - synthetic geo - spatial clusters
نویسندگان
چکیده
Syndromic surveillance systems, especially software systems, have emerged as the leading outbreak detection mechanisms. Early outbreak detection systems can assist with medical and logistic decision support. One important concern for effectively testing these systems in practice is the scarcity of authentic outbreak health data. Because of this shortage, creating suitable geotemporal test clusters for surveillance algorithm validation is essential. Described is an automated tool that creates artificial patient clusters by varying a large variety of realistic outbreak parameters. The cluster creation tool is an open-source program that accepts a set of outbreak parameters and creates artificial geospatial patient data for a single cluster or a series of similar clusters. This helps automate the process of rigorous testing and validation of outbreak detection algorithms. Using the cluster generator, single patient clusters and series of patient clusters were created – as files and series of files containing patient longitude and latitude coordinates. These clusters were then tested and validated using a publicly-available GIS visualization program. All generated clusters were properly created within the ranges that were entered as parameters at program execution. Sample semi-synthetic datasets from the cluster creation tool were then used to validate a popular spatial outbreak detection algorithm, the M-Statistic. Thesis Supervisor: Peter Szolovits Title: Director, Clinical Decision Making Group, MIT Computer Science and Artificial Intelligence Laboratory Research Supervisor: Kenneth Mandl Title: Research Director, Center for Biopreparedness at Children's Hospital Boston
منابع مشابه
Developing a Recommendation Framework for Tourist by Mining Geo-tag Photos (Case Study Tehran District 6)
With the increasing popularity of sharing media on social networks and facilitating access to location technologies, such as Global Positioning System (GPS), people are more interested to share their own photos and videos. The world wide web users are no longer the sole consumer but they are producers of information also, hence a wealth of information are available on web 2.0 applications. The ...
متن کاملClustering and visualization of non-classified points from LiDAR data for helicopter navigation
In this paper we propose a dynamic DBSCAN-based method to cluster and visualize unclassified and potential dangerous obstacles in data sets recorded by a LiDAR sensor. The sensor delivers data sets in a short time interval, so a spatial superposition of multiple data sets is created. We use this superposition to create clusters incrementally. Knowledge about the position and size of each cluste...
متن کاملDeveloping 3 dimensional model for estimation of acoustic power in urban pathways in geo-spatial information system framework
Around the word, traffic growth is causing growing air and noise pollution. Noise levels in a given area are affected by traffic on the streets as well as effective factors, including existing infrastructure and industrial centers, and so on. The purpose of this research is to model and estimate the amount of acoustic emission in the streets of Tehran's third district, using the 3D spatial info...
متن کاملA software tool for creating simulated outbreaks to benchmark surveillance systems
BACKGROUND Evaluating surveillance systems for the early detection of bioterrorism is particularly challenging when systems are designed to detect events for which there are few or no historical examples. One approach to benchmarking outbreak detection performance is to create semi-synthetic datasets containing authentic baseline patient data (noise) and injected artificial patient clusters, as...
متن کاملA new method to consider spatial risk assessment of cross-correlated heavy metals using geo-statistical simulation
The soil samples were collected from 170 sampling stations in an arid area in Shahrood and Damghan, characterized by prevalence of mining activity. The levels of Co, Pb, Ni, Cs, Cu, Mn, Sr, V, Zn, Cr, and Tl were recorded in each sampling location. A new method known as min/max autocorrelation factor (MAF) was applied for the first time in the environmental research works to de-correlate these ...
متن کامل